Particle Filter Based Soft-mask Estimation for Missing Feature Reconstruction
نویسندگان
چکیده
In this work, we show how particle filter (PF) based speech feature enhancement can profitably be combined with soft-decision missing feature reconstruction. The combined approach is motivated by the fact that standard minimum mean square error noise compensation techniques fail to give accurate estimates of the clean speech spectrum if the noise spectral power significantly exceeds that of speech in a particular spectral region. Experiments show that the proposed algorithm can reduce the word error rate by up to 26.1% relative, compared to 17.0% for speech feature enhancement based solely on particle filters.
منابع مشابه
3D Reconstruction from Stereo Camera Dynamic Image based on Particle Filter
Abstract: A new method for 3D reconstruction from dynamic stereo image by using state estimation with particle filter is proposed. Associations of feature points between two images and 3D position of the feature points are estimated simultaneously by particle filter. It is due to the applicability of particle filter for nonlinear and non-Gaussian state space model including unknown associations...
متن کاملMask estimation in non-stationary noise environments for missing feature based robust speech recognition
In missing feature based automatic speech recognition (ASR), the role of the spectro-temporal mask in providing an accurate description of the relationship between target speech and environmental noise is critical for minimizing the degradation in ASR word accuracy (WAC) as the signal-to-noise ratio (SNR) decreases. This paper demonstrates the importance of accurate characterization of instanta...
متن کاملEnvironment-independent mask estimation for missing-feature reconstruction
In this paper, we propose an effective mask-estimation method for missing-feature reconstruction in order to achieve robust speech recognition in unknown noise environments. In previous work, it was found that training a model for mask estimation on speech corrupted by white noise did not provide environment-independent recognition accuracy. In this paper we describe a training method based on ...
متن کاملNeural Network Based Missing Feature Method For Text-Independent Speaker Identification
The first step of missing feature methods in text-independent speaker identification is to identify highly corrupted spectrographic representation of speech as missing feature. Most mask estimation techniques rely on explicit estimation of the characteristics of the corrupting noise and usually fail to work with inaccurate estimation of noise. We present a mask estimation technique that uses ne...
متن کاملSoft missing-feature mask generation for simultaneous speech recognition system in robots
This paper addresses automatic soft missing-feature mask (MFM) generation based on a leak energy estimation for a simultaneous speech recognition system. An MFM is used as a weight for probability calculation in a recognition process. In a previous work, a threshold-base-zero-or-one function was applied to decide if spectral parameter can be reliable or not for each frequency bin. The function ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008